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Detecting the dark matter annihilation signal from Galactic substructure, or subhalos, is an 
important challenge for high-energy gamma-ray experiments. In this paper we discuss detection 
prospects by combining two different aspects of the gamma-ray signal: the angular distribution and 
the photon counts probability distribution function (PDF). The true PDF from subhalos has been 
shown recently (by Lee et al.) to deviate from Poisson; we extend this analysis and derive the signal 
PDF from a detailed ACDM-based model for the properties of subhalos. We combine our PDF 
with a model for Galactic and extra-Galactic diffuse gamma-ray emission to obtain an estimator 
, and projected error on dark matter particle properties (mass and annihilation cross section) using 

jj^ ■ the Fermi Gamma-Ray Space Telescope. We compare the estimator obtained from the true PDF 

to that obtained from the simpler Poisson analysis. We find that, although both estimators are 
unbaised in the presence of backgrounds, the error on dark matter properties derived from the true 
PDF is ~ 50% smaller than when utilizing the Poisson-based analysis. 
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PACS numbers: 95.35.+d; 95.85.Pw 



I. INTRODUCTION 
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A wide variety of evidence points to the existence of nonbaryonic dark matter ll[ . There are three ways of directly 
confirming this hypothesis: producing dark matter or its cousins in an accelerator directly detecting dark matter 

particles impinging on the Earth in underground detectors 0, [E[ , and indirectly detecting dark matter by observing 
the products of an annihilation of two dark matter particles in space [H-|8| . The current excitement in the field stems 
from the coincidental maturity of all three of these techniques. The Large Hadron Collider began operations in 2009, a 
number of direct detection experiments have proven their ability to scale up to the one-ton level, and there are several 
' experiments (Fermi Gamma- Ray Satellite Telescope @, Atmospheric Cerenkov Telescopes PAMELA [ll|, Ice 
O'S , Cube [IH) poised to detect the indirect signal. 

For gamma-ray experiments a key challenge is to extract the dark matter signal in the presence of emission from 
(N ■ point sources, such as pulsars and AGN, and diffuse sources such as cosmic rays. One way to discriminate photons 
\Q 1 produced by a given dark matter source from the above backgrounds is to measure the energy spectrum. Photons 
generated by the annihilation of standard thermally-produced particle dark matter have a spectrum characteristic of 
quark production and hadronizationfl^, [l4|, distinguishing them from the typical power law-like behavior of other 
sources [HI, HH. A second discriminant is the angular distribution [171 - 121) . The angular distribution of photons 
produced in dark matter annihilations results from the variation of the dark matter density profile as a function of 
tp, the angle between the incoming direction and the line connecting us to the Galactic center. By contrast, the 
extra-Galactic background [22|, [23[ is more or less isotropic, and the diffuse Galactic background is predominantly 



confined to the Galactic disk 



Recently, several groups |24H26| have explored the possibility of indirect detection in the Milky Way halo by 
adding another discriminant, the probability distribution function (PDF). In their recent analysis Lee et al. [24j have 
determined the PDF of photons produced by dark matter annihilations in dark matter substructure (subhalos) in our 
Galaxy, and they have shown that this PDF is clearly distinct from a Poisson distribution. In particular, for a given 
pixel observed by, e.g., the Fermi telescope, there is an unusually large probability (unusual compared with Poisson 
expectation for the same mean number of counts) of observing multiple counts from the population of subhalos along 
the line-of-sight. 

Here we test the idea of using the PDF, together with a ACDM-based model for the scatter in subhalo properties, 
to extract the dark matter signal in the Fermi experiment. In this work we are in particular interested in answering 
the following questions: 

• Can the PDF - if known - be used as an effective tool to extract the dark matter signal? 

• Will Fermi have the statistical reach to probe a velocity-weighted annihilation cross section of 3 x 10~ 26 cm 3 
sec -1 , the canonical value for a thermally produced dark matter candidate? 
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• Does one need to know the PDF in order to analyze an experiment? I.e., if one incorrectly assumes a x 2 
distribution, will s/he be led to incorrect conclusions about the parameters under consideration? 

For concreteness, throughout, we make predictions and projections for one year of Fermi data. 

The layout of this paper is as follows. EjTT] describes the 3-component model of subhalo, diffuse Galactic and extra- 
Galactic emission that we use. Simulated maps of these components are produced in £11111 We then analyze the maps 
two different ways in ^IVI with a standard \ 2 analysis and with the exact likelihood. The former does not use the 
information contained in the PDF, while the latter analysis does use this information. Our conclusions are presented 
in SjVl 



II. THE MODEL 



We assume a three-component model for the diffuse gamma-ray background: annihilation radiation from dark 
matter subhalos, Galactic emission, and extra-Galactic emission. This simplified model neglects other contributions 
to the gamma-ray background, including point sources - both Galactic and extra-Galactic - which we assume can be 
identified and removed. We also neglect other dark matter sources, including diffuse emission from the Milky Way 
halo and from cosmological halos, as we are in particular concerned with isolating the subhalo contribution. Given 
the latitudes that we consider for our analysis this model is appropriate [13] ■ As we argue here, even our simplified 
model represents an improvement in our understanding of diffuse emission from subhalos and our ability to extract it 
using gamma-ray data. In the context of the larger goal of detecting dark matter our assumptions may be viewed as 
conservative, as we are neg lecting several possible sources of signal. 

Following Lee et al. [1J we write the probability of obtaining C< counts in bin i which is an angle ipi away from 
the Galactic center as 

P(d) = J dFP sh (F; i>i)V[EiF + Cf + C° s ; Q], (1) 

where P s h(F; ipi) is the probability of subhalos producing a flux F which depends on ipi in the pixel; V is the Poisson 
probability for obtaining d counts if the mean number of counts is equal to F multiplied by the exposure of the pixel 
in the experiment, Ei, plus the counts expected from the two background sources, Cf and C* s . We are implicitly 
assuming here that the PDF's of both background components - Galactic and extra-Galactic - are Poisson, as opposed 
to the PDF of the subhalo contribution which is captured in P s h- This is the best one could hope for when examining 
the utility of the PDF; if the PDF turns out not to matter much in our analysis, then this will be a robust conclusion. 
In the rest of this section, we describe the details of this model, now specified by P s h(F; ipi) and the expected number 
of counts due to backgrounds Cf a and C z eg . 



A. The Signal: Emission from Subhalos 

In order to calculate the counts probability distribution function given in Eq.[TJ we must estimate the flux probability 
distribution P s h(F;ipi), which depends on a description of the abundance and properties of all subhalos along the 
line of sight. Following Lee et al. [2J|, we first calculate Pi(F; ipi), the probability of observing a flux F from a single 
subhalo at angle ipi from the Galactic center: 

Pl(F) iPi) cx 0{F max - F) J di J dL sh P(L sh , £, iPi)6 (f - ^) . (2) 

Here, P(L s /j,€, ipi) is the probability of finding a subhalo emitting luminosity L s /j at a distance I from us at an 
angle ipi from the Galactic center. The step function limits the flux to be less than -F m ax since sources with larger 
fluxes will be identified as resolved point sources. Although the resolved flux limit of Fermi depends on energy 1 , for 
concreteness we choose a simple threshold of F nmx = I0 _9 cm _2 s _1 . The line-of-sight integral extends out to £ max , 
which is determined by the assumed extent of the dark matter halo. The probability P(L s h,£,ipi) can be broken up 
into a convolution of the well-studied mass function with the conditional luminosity function: 

P(L sh , £, oc fdi / dM dN }lf^; )] P[L sh I M, r(l, ^)], (3) 
J Mmin dMdV 



1 http:/ /www-glast. slac.stanford.edu/software/IS/. 
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with r(£, ipi) = \J C 2 + <^o — 2£d Q cos ipi where d@ = 8.5 kpc is the Galactocentric distance of the Sun. The assumption 



that the dark matter halo extends out to Rg = 220 kpc leads to £ max = df. 



cos ipi 



■ sin 2 ipi 



+ (R G /d G ) 2 



. The 



lower limit on the mass integral, M m i n , is determined by the cutoff scale of the subhalo mass function in the Milky 
Way halo. Supersymmetric models with WIMP dark matter candidates typically have a cutoff scale in the dark 
matter power spectrum in the range M m j n ~ 10~ 6 — 10° M Q 28-35J; motivated by these models, for all results here 
we will adopt a value of A/ m j n = 0.01A/©. We discuss the impact of varying Af m i n about this fiducial value below. 
The upper limit on the halo mass (which is not particularly relevant since the mass function falls off fairly steeply) is 
taken to be I0 10 M©. With this information, Eq. [2] can now be written as 



F) 



die 



Mr, 



dM 



dN[r(e,^)} 
dMdV 



P[L ah =AT:l 2 F\M,r{l^ t )]. 



(4) 



To complete this calculation, we need the mass function and conditional luminosity function. In Lee et al. [24j |. 
it was assumed that there is a one-to-one mapping between subhalo luminosity and the mass of a subhalo, namely 
L s f, oc M s f,. For our analysis we determine the L s y, — M s f, relation using the properties of simulated subhalos in a 
ACDM cosmology 36]. The properties of subhalos, including those that will be relevant for us such as the spatial 
distribution and the assigned gamma-ray luminosity, reflect the underlying process of non-linear structure growth. 
The complex interplay between formation redshift, time of accretion to the parent halo, and orbital and tidal evolution 
sets the characteristics of the luminosity- mass relationship of subhalos, as well as the radial distribution (see [36|). As 
a result of this process, subhalos with similar mass and Galactocentric radius will have a spread in their gamma-ray 
luminosities. 

We include this non-zero scatter by using the conditional luminosity distribution found in [36j], 



P(lnL sh \M,r) 



1 1 



exp 



[In L sh - (In L sh ) 
2ct 2 



(5) 



For a dark matter halo with a concentration of approximately c ~ 10 (model Co in |36jh the mean luminosity (L s h) 
as well as the spread about the mean luminosity a depend on subhalo mass and Galactocentric radius via 



(ln^/s^ 1 )) = 77.4 + 0.87 ln(M/lO 5 M ) - 0.23 ln(r/50kpc) + In 



/SUSY 



10- 



GeW 



(6) 



a = 0.74- 0.0030 ln(Af/10 5 M©) - 0.011 ln(r/50kpc). (7) 
The quantity /susy is the particle physics parameter 2 governing the emission rate, 

/susy = (8) 
m x 

Here the mass of the dark matter particle is m x , (av) is the thermally averaged annihilation cross section times the 
velocity, and iV 7 is the number of photons above 1 GeV emitted in the annihilation of a single dark matter pair. 
A thermally averaged cross section of (av) — 3 x 10~ 26 cm 3 s _1 leads to the correct thermal abundance of dark 
matter today so that our fiducial value of /susy = 10~ 28 cm 3 s _1 GcV~ 2 is easily accommodated in supersymmetric 
models 0|Hz|]- 

Thus the mean luminosity in Eq. ([6]) differs from that of Lee et al. [24| in several ways: it scales with mass as 
L s h oc M 87 , in agreement with simple analytic estimates [38| . as well as numerical simulation results [U [39|). 
Furthermore, the luminosity depends on the radial position of subhalos (L s h oc r ~ Q - 23 ^ a nd we also include a non-zero 
scatter (Eq. JJ}) about the mean value of the luminosity, a scatter which depends on the mass and the Galactocentric 
radius of subhalos. 

Numerical simulations predict a mass function of the form 



(M/Mq)' 
f(l + r) 2 



dNM/dMdV = A K _' ®> , (9) 



2 We use the /susy to conform to the literature, but nothing in our analysis depends on supersymmetry; all that matters is the combination 
of cross section, mass, and 7V 7 folded into /susy- 
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with (3 ps 1.9 40] . Here the radial dependence is through f = r/r s , where r s is the scale radius of the Milky Way 
halo (r s fa 21kpc). We normalize the mass function by utilizing the numerical result that roughly 10% of the mass 
of the Galactic halo (Mq = 1.2 x 10 12 M©) is in subhalos of mass greater than ~ 1O 7 M . With this assumption, the 
normalization constant is A ps 1.2 x 10 4 MQ 1 kpc -3 . Simulations also suggest that the halo distribution may be less 
cuspy near the center than the dark matter profile, and may depend on the mass of the subhalo [4lJ. This may have 
implications on the expected annihilation signal from substructure as the overall number of counts along a particular 
line of sight will be lower than expected (especially if most of the signal arrives from nearby objects). Nevertheless, 
given the current uncertainties of the level of this effect, we do not include a core in the distribution of subhalos in this 
study, but we emphasize that the issue of substructure depletion in the inner regions of the Galaxy must be addressed 
in detail in future numerical simulations. 



1.0000F 




Flux (photons/beam/year) 

FIG. 1: Probability Pi(F,ipi = 40°) of observing flux F from a single halo in a given square degree pixel. We measure flux in 
units of photons/beam/year, where a 'beam' corresponds to the approximate effective area of the Fermi telescope, A ~ 2000cm . 
The solid curve uses luminosity and mass functions from this paper with M m i n = O.OIM©, while the dashed curve uses the 
same functions with M m i n = 1O~ 6 M0. The dotted curve shows Pi from Ref. [24| with M m i n = O.OIMq, re-scaled to have the 
same mean flux as our Pi for the purpose of comparison. 

With the above ingredients we construct the probability of observing a single subhalo with flux F in pixel i, 
Pi(F,if>i), shown in Fig. Q]for tpi = 40°. In generating this figure, we have used flux units of photons/beam/year, 
where the beam corresponds to the detector area of the Fermi telescope, A ~ 2000cm 2 (the true effective area of 
Fermi is energy dependent so this value is only approximate). Of particular note in this figure is the smoother fall off 
at low flux in our model relative to the model of Lee et al. [24( (for the purpose of comparison we have scaled the Lee 
et al. 24] model so that it predicts the same mean flux as our model). As both of these models assume a sharp mass 
cut-off at the low end, the difference in fall-off at low flux follows directly from the scatter in luminosity for a given 
mass. When the low end mass cut-off, M m ; n , is changed, we see from Fig. [T] that the mean flux per subhalo decreases 
but that the shape of Pi(F) remains essentially unchanged. 

Also, note that the PDF's for both models are very similar at the high flux end as a result of competing differences 
between the mass functions and the mass-luminosity relations of the two models. For a mass function that goes as 
dN/dM cx M'P and a mass-luminosity relation such that L s h oc M a , it can be shown that for large values of F, 
PjjF) oc F~t with 7 = (1 - 0)/a - 1. For our model we have 7 = (1 - 1.9)/0.87 - 1 = -2.03 while for the Lee et al. 
[24j model they have 7 = (1 — 2)/l — 1 = —2. Thus, in both models Pi(F) is approximately proportional to F~ 2 
for large F. Physically, our less-steep mass function means that we have more high mass (and thus high luminosity) 
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subhalos than the Lee et al. [2J] model, but our less-steep luminosity function means that these subhalos are not as 
bright. The end result is that on the high flux end both models are very similar. 

We use Pi(F; ipi) to determine P s h(F; ipi), the probability of observing a total flux from multiple subhalos at angle 
ipi from the Galactic center. The two functions are related by 



At (^i)(^{Pi(F;Vi)}- 



"}• 



(10) 



where J- indicates a Fourier transform with respect to F and fi is the mean number of subhalos in a given pixel: 

r dN [r(£,i/ji)] 



dM- 



dMdV 



(11) 



Qpixei is the solid angle of a single pixel, taken here to be one square degree. Eq. [10] can be derived by assuming that 
the number of subhalos contributing to the photon counts in a single pixel is a Poisson random variable with mean fj, 
and that each subhalo emits a flux F with probability Pi(F;ipi). A detailed derivation of Eq. [10] is presented in the 
appendix of [24| . 

Finally, given P s h(F; ipi) we can construct -P(Cj), the probability of getting C counts in pixel i by applying Eq. |T]). 
Fig. [Ushows the calculated P{Ci) for the dark matter signal for one year of observation by the Fermi Telescope. Note 
that a Poisson distribution with the same mean number of counts has a significantly smaller probability of producing 
high-count pixels than the true P(C,). Also, note that despite the differences between our model and that of Lee 
ct al. [24|, both produce PDFs that are very similar. Apparently, the differences between the two models are washed 
out through the transition to P s h(F;tpi) and the subsequent discretization to produce P(Cj). The similarity between 
the two models is encouraging: it suggests that the form of P(Ci) is somewhat independent of the many assumptions 
that go into such models (e.g. the mass function, the luminosity function, M m - m , etc.), thus making our conclusions 
more robust. 

In Table [I] we show the expected number of counts for our fiducial model, as well as four other models where we 
vary the cutoff scale of the mass function and the concentration (and substructure mass fraction) of the host Milky 
Way halo. The effect of the subhalo mass function cutoff on the photon counts is due to the fact that the luminosity 
increases with mass at a slower rate than the rate at which the abundance is increasing with mass - numerous small 
(and faint) subhalos yield a higher flux than few large and bright subhalos. The effect is not very large, as decreasing 
M m i n by four orders of magnitude results in only a factor of ~ 4.5 increase in the photon counts. Still, understanding 
the low-mass cutoff scale of the subhalo mass function is important in any future interpretation of 7-ray data. 

The high and low concentration models in Table [I] refer to models C+ and C_ respectively in [36|. They represent 
host Milky Way halos with high (c > 13) and low (c < 7) concentrations. The luminosity PDF is a weak function 
of concentration, except perhaps in the very inner regions of the halo. The normalization of the subhalo mass 
function, however, depends somewhat strongly on the host concentration. High concentration host halos have a lower 
normalization of substructure / ps 0.08 (where / is the mass fraction of subhalos relative to the total halo mass) 
relative to low concentration halos which have a higher normalization of substructure / ~ 0.3. This is an outcome 
of hierarchical structure formation. High concentration host halos were formed earlier and therefore their constituent 
subhalos evolved for a longer period of time in the presence of the tidal field of the host, thus the subhalo survival 
rate is lower than in the low concentration (recently formed) hosts. As can be seen from Table H] varying the host 
concentration changes the total photon counts by at most about 60%. 



Model 


Mean Counts 


Approximate 




at V = 40° 


Total Counts 


Fiducial 


0.83 


6600 


Fiducial with M min = 10~ 6 M Q 


1.36 


29800 


Fiducial with A/ min = 1O 2 M 


0.49 


4000 


High Host Concentration 


1.57 


12100 


Low Host Concentration 


0.91 


7300 



TABLE I: The mean number of signal counts in one year per square degree at an angle ip — 40° relative to the Galactic 
center, and the approximate total signal counts on the sky at latitudes greater than b > 40° for our fiducial model and two 
models which demonstrate the effects of our lack of knowledge of the subhalo mass function cutoff scale. The low and high 
concentration models represent extreme models of the host Milky Way properties. 
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FIG. 2: P s h(C, ipi), the probability of observing C counts in pixel i from all subhalos along the line of sight where here ipi = 40°. 
The P s h{C,ipi) predicted by our model is compared with that of Lee et al. |24|. which we have scaled to have the same mean 
as our model. The two functions are very similar despite the underlying differences of the two models. Both differ significantly 
from a pure Poisson distribution with the same mean number of counts. 

B. The Backgrounds: Galactic and Extra-Galactic 

We now move on to discuss the sources of gamma-rays that we consider in our analysis in addition to the signal 
from subhalos. For ease of book-keeping, we will simply describe gamma-rays from non-subhalo sources as either 
Galactic or Extra-Galactic in origin, and now discuss each of these components in turn. 

Galactic Background- Cosmic ray interactions with atomic (HI) and molecular (primarily Hi and CO) gas are the 
source of diffuse Galactic gamma-ray emission. The emission results from the decay of neutral pions produced in 
hadronic collisions as well as inverse Compton scattering of the interstellar radiation field by electrons, and to a lesser 
extent bremsstrahlung emission from the interstellar medium. Accurately modeling this emission is challenging (42j 
and indeed crucial for the interpretation and extraction of a dark matter component in the gamma-ray background. 

In our analysis we utilize the standard diffuse gamma-ray emission model of the LAT science team 3 . We take the 
LAT team model, which is based on the observed distribution of gas as well as known point sources, as the prediction 
of the number of counts in the i th pixel, C i ' FeTlm , for one year of observation. Note that the predicted number of 
counts generated by the signal depends only on the angle ipi that separates the pixel from the center of the Galaxy. 
However the backgrounds are different: (j^- Fciml depends on both U and bi, and hence not only on ipi but also on the 
azimuthal position in the annulus. 

In each angular pixel, our total number of counts is obtained by summing over all photons with energy above 1 
GeV. We choose this energy threshold mainly because most of the photons emitted by dark matter pairs with mass 
~ 100 GeV or greater are above this energy, and also because the diffuse Galactic emission is observed to be a steeply 
falling power law near these energies. A different choice of energy threshold is trivial to incorporate into the dark 
matter model since it simply corresponds to different 7V 7 in the definition of /susy, so changing the energy threshold 



3 http: / / fermi.gsfc.nasa.gov/ssc / data/ access /lat/BackgroundModels.html 
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FIG. 3: Expected counts per one square degree pixel above 1 GeV in 1 year of Fermi data from dark matter annihilation 
in subhalos when /susy = 10 -28 cm 3 s _1 GeV -2 , the Galactic background, and the diffuse extra-Galactic background. The 
counts are given in a one square degree pixel. 

corresponds to changing /susy- 

Using the LAT team diffuse model, we simulate sky maps of diffuse gamma-ray emission. When fitting these maps 
to our model, we introduce one free parameter, given by the amplitude of the counts b g . So when considering the 
diffuse Galactic emission, our model is simply given by 

Cf 1 = & g Cf al ' Fermi (12) 

with the true value of b g = 1. 

Fig. [3] shows the counts from the diffuse Galactic model in a one square degree pixel as a function of angle from the 
Galactic center, ip, with our fiducial normalization (b g — I). Also plotted is the expected signal flux from dark matter 
subhalos in equally sized pixels. For all angles the counts from the Galactic model are at least an order of magnitude 
greater than the counts from subhalos. As expected, the signal flux falls off with increasing ip because the number 
density of subhalos decreases with distance from the galactic center (see Eq. [9]). The galactic background increases 
towards -0 = 0° and ip = 1 80° because most of the diffuse emission is from the galactic plane. 

Extra- Galactic Background- The isotropic component of the LAT team diffuse model is a result of the emission 
from extra-Galactic and instrumental sources. Over the energy range of ~ 100 MeV - 100 GeV, and for b > 40°, the 
isotropic component ascribed to extragalactic emission is well fit by a power law with index 2.41 [43|. The updated 
diffuse model indicates that above 1 GeV, the normalization of the extra-Galactic component is comparable to that 
of the dominant component of Galactic emission that arises from neutral 7r decay. In our analysis, we will simply 
model the extra-Galactic component by a number of counts with an amplitude that is allowed to be free, 

Cf = 6 efl Cf ' Ferroi . (13) 

Fig. [3] shows our fiducial normalization (b eg = I) is one in which the extra-Galactic flux is about 30 times greater 
than the subhalo flux, contributing ~ 15 counts above a GeV in one square degree pixel. 



Dark matter subhalos 
Galactic Background 
Extragalactic Background 
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III. SIMULATED MAPS 

Armed with the probability distribution in Eq. ([I]), we can generate simulated maps of the sky for a given experiment 
specified by its exposure, £7j. First, though, we construct a simpler map, shown in Fig.Uto assess "by eye" the impact 
of the assumed subhalo PDF. Both maps in Fig. @] are simulations which include the subhalo dark matter signal only. 
The central 40° is not used so it is zeroed out in all of our maps, since it will be dominated by Galactic emission. 
The top panel map in Fig. 2] is drawn from a model with the same number of expected counts in every pixel as the 
model introduced in §11. The counts in each pixel in this map, however, are drawn from a Poisson distribution. The 
bottom panel has photons drawn from the "true" dark matter PDF. Fig. [2] showed how different the subhalo PDF is 
from Poisson, and Fig. [4] illuminates this difference very graphically. There are a number of pixels with many counts 
(of order ten) in marked contrast to the Poisson map which has no high-count pixels. 

This visual impression is hidden in a map with backgrounds. Fig. [5] shows two maps with the same dark matter 
counts as in Fig. 21 but with counts from the backgrounds added in. It is no longer possible to tell the distributions 
apart by eye, so a more careful statistical probe is needed. We analyze the maps in the next section to see if the 
subhalo signal can be extracted. 

IV. ANALYSIS 

We will analyze the simulated sky constructed in the previous section in two different ways. First, we will carry out 
a simple Poisson analysis to obtain constraints on the parameters. That is, we fit the data by maximizing a likelihood 
which assumes (incorrectly) that all sources of photons are generated from a Poisson distribution, 

N p 

£ Po iS so„ (/susYj ^ beg) ^Y[V [EMfsvsr) + Cf(b g ) + Cf (b eg ); C t ] . (14) 

i=l 

In Eq. Q31 the parameters specifying the amplitudes of the background are b g and b eg ; (7, is the observed number 
of counts in pixel i (there are a total of N p pixels); Fi is the mean expected flux from dark matter annihilations in 
pixel i; and V[A;B] again is the Poisson probability of observing B counts in a pixel in which the mean expected 
number of counts is A. We emphasize that we are (purposely) doing things wrong here: we are analyzing a map 
generated from one distribution assuming incorrectly that the map is Poisson. One of the goals is to determine 
whether this flawed (yet simpler) analysis obtains the correct answer. We fit the data to the three free parameters, 
find the best fit value in this 3D space, and then identify 1-, 2-, and 3-sigma constraints by finding regions within 
which J dfsusYdb g db e g = 0.68, 0.95, 0.997. The best fit value is termed an estimator, the Poisson estimator. 

A second way to analyze these simulated maps is to use the "true" likelihood. We want to see how much better 
this approach is than the Poisson analysis. Here we use the exact likelihood, 

N p 

£ = l[P(Ci\f S vsY,b g ,b eg ), (15) 

i=l 

where P(Cj|/susY, b g , b eg ) is given in Eq. (JTJ) . Again, we can form an estimator and allowed regions for the parameters; 
we call this estimator the "true" or "exact" estimator. 

We will apply each of these estimators to the signal+background maps constructed in § IIII1 but first let us work 
on the background-free maps. There it was easy to tell the difference between the 2 PDF's by eye, so we expect to 
see considerable differences in the analyses. Fig. [S] show the results of ten runs applying each estimator. The "true" 
likelihood extracts the correct value accurately and obtains small error bars. In contrast, the Poisson likelihood 
consistently mis-estimates the value of /susy- Apparently, the Poisson estimator is misled by the many pixels with 
few counts so systematically shifts the mean number of counts lower, thereby leading to an under-estimate of /susy- 

When backgrounds are added in, it becomes less important to use the correct PDF. To see this, consider the 
constraints obtained on one simulated map using the two estimators, as shown in Fig. [7] In this realization, both 
estimators recapture the true parameter values. The errors on the parameters are larger when the simpler, Poisson 
estimator is used, but the overall impression is that using the Poisson estimator would not do appreciable damage. 

To test this further, we generated 10 such maps. Fig. [5] shows the distribution of best fit values of /susy, b eg , and 
b g from these runs. The means are both close to the true values of the parameters. The errors from the Poisson 
analysis are larger by 50 percent, so knowing the PDF does help, but the danger of a bias appears to be eliminated. 

For observation times greater than the one year that we assume in our analysis, the size of the error contours will 
of course decrease. Since /susy is proportional to the photon flux and since we have shown that the errors in /susy 
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FIG. 4: Simulated maps of the photons above one GeV in Fermi produced by the annihilations of dark matter in subhalos. 
Top panel: Simulated counts drawn from a Poisson distribution with the same number of expected events as the dark matter 
PDF. Bottom panel: Photons drawn from the dark matter PDF. 

are reasonably described by Poisson statistics, the fractional error in our determination of /susy will scale in inverse 
proportion to the square root of the exposure time. Therefore, with data covering the five year expected lifetime of 
the Fermi Telescope, we expect the error contours on /susy to be about 55% smaller than shown here. 
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FIG. 5: Same as Fig. [4] but with the addition of backgrounds from the Galaxy and unresolved extra- Galactic sources. 

V. CONCLUSIONS 

The gamma ray signal from annihilation of Galactic dark matter subhalos has a probability distribution function 
which is very different from a Poisson distribution with the same number of mean counts. This feature, initially 
explored in Ref. [HI and fleshed out here with a slightly less restrictive model, should produce in Fermi many pixels 
with zero or small number of counts but a finite set with large number of counts. We have addressed here the question 
of how this PDF will affect future analyses and concluded that, once the backgrounds are added in, a simple analysis 
which assumes a Poisson PDF is unbiased and only slightly less powerful than one which uses the full, correct PDF. 

To some extent this is good news: there is a tension between analyses which are agnostic as to the nature of the 
signal and those which assume that many of its underlying features are known and are simply fitting for parameters. 
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FIG. 6: Best fit values of /susy from multiple runs when backgrounds are not included. The true likelihood recaptures the 
input value of /susy = 10 -28 cm 3 s _1 GeV -2 , while the Poisson likelihood systematically under-estimates /susy- The error 
bars shown represent 3a confidence intervals. 



Those in the first class are more robust and believable because they are based are fewer assumptions; those in the 
second are more powerful statistically and will lead to tighter constraints on the properties of dark matter. When we 
find little loss in statistical power from dropping an assumption and moving towards more agnostic estimators, we 
should become more optimistic about our chances of extracting a signal hidden in backgrounds. This is perhaps the 
most important result of this work. 

Lingering in our discussion, and in the literature at large (see, e.g., Q), is the question how much information will 
the data contain? The answer to this is encoded in the likelihood function, and our conclusions are that one year of 
Fermi data contains enough information to detect a value of /susy = 1CP 28 cm 3 s _1 GcV~ 2 . We are not claiming 
that a robust detection of this small a signal can be expected (for an incomplete sample of analyses, see [43l - |46j ). but 
just that the information is there and we should attempt to extract it. 

Our analysis has included/assumed two types of information about the signal and backgrounds: the angular dis- 
tribution and the PDF. We have not included two other potential discriminants: the spectral shapes of the different 
components and the angular two-point functions. The former is easy to include within the formalism developed here, 
and we plan to address this in future work. The latter has been explored by a number of authors [2g, l47H5l| in the 
form of the C/'s. There is a connection between our work and the fluctuations explored elsewhere: we have implicitly 
assumed a flat C; spectrum, but one that has a larger amplitude than Poisson (because the PDF is not Poisson). 
Whether or not this set of assumptions includes all of the effects explored elsewhere is an open question. 
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FIG. 7: Constraints from one simulated map of signal and backgrounds on the 3 parameters when the underlying model has 
/susy = 1CF 28 cm 3 s _1 GeV -2 and b g = b eg — 1. Left panels: Results using the "true" PDF for dark matter. Right panels: 
Constraints assuming a Poisson likelihood. The Poisson analysis retrieves the correct result even though it assumes the wrong 
PDF; the allowed region is slightly larger if the true likelihood is not known, but there is no bias. 
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FIG. 8: Best fit values of /susy and background amplitudes from multiple runs. The true values (/susy = 10~ 28 cm 3 s _1 
GeV -2 , b g — 1, b eg = 1) are recaptured by both estimators, but the errors - especially on /susy - are about 50 percent larger 
when the Poisson estimator is used. The error bars shown represent 3cr confidence intervals. 
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